-
Notifications
You must be signed in to change notification settings - Fork 531
[CI] Add new runner and enable QwQ multinpu test #417
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
e8aa80c to
dd0a9c2
Compare
1599f82 to
ebdad10
Compare
| tensor_parallel_size=4, | ||
| distributed_executor_backend=distributed_executor_backend, | ||
| ) as vllm_model: | ||
| vllm_model.generate_greedy(example_prompts, max_tokens) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I remember last offline discussion, we'd also want to add transformer as compare, does it works (stable works)?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
will finish at next pull request
| fi | ||
| pip install /root/.cache/pta/torch_npu-2.5.1.dev20250320-cp310-cp310-manylinux_2_17_aarch64.manylinux2014_aarch64.whl | ||
| - name: Run vllm-project/vllm-ascend test |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
V1 engine should be tested as well.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we shouldn't test v1 like this. v1 doesn't fully support in some features. in addition we should mock the VLLM_USE_V1 in the related test modules if in need
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why? We only test V1 for vllm-ascend test, not vllm test.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I mean we will construct pytest fixture to decide which cases need to be tested on v1
7b0213b to
59578d7
Compare
| env: | ||
| VLLM_USE_V1: 1 | ||
| VLLM_WORKER_MULTIPROC_METHOD: spawn | ||
| VLLM_USE_V1: 0 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why removed v1 test
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add it here @pytest.mark.parametrize("use_v1", ["1", "0"])
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
5605150 to
83df77b
Compare
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
| @@ -1,5 +1,8 @@ | |||
| [pytest] | |||
| minversion = 6.0 | |||
| markers = | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this markers is useless now, right?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this markers help pytest detect what kind of cases should be run, just like pytest -m 'multinpu'
|
I'll fix the nit in the follow-up PR |
### What this PR does / why we need it? - Add a new runner to the continuous integration system and keep the original CI runner until the new runner runs stably - Add distributed test cases ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed --------- Signed-off-by: wangli <wangli858794774@gmail.com>
### What this PR does / why we need it? - Add a new runner to the continuous integration system and keep the original CI runner until the new runner runs stably - Add distributed test cases ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed --------- Signed-off-by: wangli <wangli858794774@gmail.com>
What this PR does / why we need it?
Does this PR introduce any user-facing change?
No
How was this patch tested?
CI passed